MODNet: Moving Object Detection Network with Motion and Appearance for Autonomous Driving

نویسندگان

Mennatullah Siam

Heba Mahgoub

Mohamed Zahran

Senthil Yogamani

Martin Jägersand

Ahmad El Sallab

چکیده

For autonomous driving, moving objects like vehicles and pedestrians are of critical importance as they primarily influence the maneuvering and braking of the car. Typically, they are detected by motion segmentation of dense optical flow augmented by a CNN based object detector for capturing semantics. In this paper, our aim is to jointly model motion and appearance cues in a single convolutional network. We propose a novel two-stream architecture for joint learning of object detection and motion segmentation. We designed three different flavors of our network to establish systematic comparison. It is shown that the joint training of tasks significantly improves accuracy compared to training them independently. Although motion segmentation has relatively fewer data than vehicle detection. The shared fusion encoder benefits from the joint training to learn a generalized representation. We created our own publicly available dataset (KITTI MOD) by extending KITTI object detection to obtain static/moving annotations on the vehicles. We compared against MPNet as a baseline, which is the current state of the art for CNN-based motion detection. It is shown that the proposed two-stream architecture improves the mAP score by 21.5% in KITTI MOD. We also evaluated our algorithm on the non-automotive DAVIS dataset and obtained accuracy close to the state-of-the-art performance. The proposed network runs at 8 fps on a Titan X GPU using a basic VGG16 encoder.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Motion and Appearance Based Multi-Task Learning Network for Autonomous Driving

Autonomous driving has various visual perception tasks such as object detection, motion detection, depth estimation and flow estimation. Multi-task learning (MTL) has been successfully used for jointly estimating some of these tasks. Previous work was focused on utilizing appearance cues. In this paper, we address the gap of incorporating motion cues in a multi-task learning system. We propose ...

متن کامل

Motion and appearance based Multi-Task Learning network for autonomous driving

Autonomous driving has various visual perception tasks such as object detection, 1 motion detection, depth estimation and flow estimation. Multi-task learning (MTL) 2 has been successfully used for jointly estimating some of these tasks. Previous 3 work was focused on utilizing appearance cues. In this paper, we address the gap 4 of incorporating motion cues in a multi-task learning system. We ...

متن کامل

Motion detection by a moving observer using Kalman filter and neural network in soccer robot

In many autonomous mobile applications, robots must be capable of analyzing motion of moving objects in their environment. Duringmovement of robot the quality of images is affected by quakes of camera which cause high errors in image processing outputs. In thispaper, we propose a novel method to effectively overcome this problem using Neural Networks and Kalman Filtering theory. Thistechnique u...

متن کامل

Motion Detection from a Moving Observer Using Pure Feature Matching

Motion detection from a moving observer has been a very important technique for 3D dynamical image analysis, especially in the research of obstacle detection and tracking for autonomous driving systems and driver supporting systems. Because of the continuously changing background, detecting the real moving objects has become very difficult and always employs optical flow method to measure the f...

متن کامل

ODMAS : Object Discovery through Motion , Appearance and Shape

In this thesis we examine the problem of Object Discovery, the autonomous acquisition of object models, using a combination of shape, appearance and motion. We propose a new technique for detecting rigidly moving objects and constructing models of their appearance and shape called the ODMAS (Object Discovery through Motion, Appearance and Shape) system. Our technique is a multi-stage approach. ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1709.04821 شماره

صفحات -

تاریخ انتشار 2017

MODNet: Moving Object Detection Network with Motion and Appearance for Autonomous Driving

نویسندگان

چکیده

منابع مشابه

Motion and Appearance Based Multi-Task Learning Network for Autonomous Driving

Motion and appearance based Multi-Task Learning network for autonomous driving

Motion detection by a moving observer using Kalman filter and neural network in soccer robot

Motion Detection from a Moving Observer Using Pure Feature Matching

ODMAS : Object Discovery through Motion , Appearance and Shape

عنوان ژورنال:

اشتراک گذاری